Model for Non-Gaussian Intraday Stock Returns 
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Stock prices are known to exhibit non-Gaussian dynamics, and there is much interest in under- 
standing the origin of this behavior. Here, we present a model that explains the shape and scaling of 
the distribution of intraday stock price fluctuations (called intraday returns) and verify the model 
using a large database for several stocks traded on the London Stock Exchange. We provide evi- 
dence that the return distribution for these stocks is non-Gaussian and similar in shape, and that 
the distribution appears stable over intraday time scales. We explain these results by assuming the 
volatility of returns is constant intraday, but varies over longer periods such that its inverse square 
follows a gamma distribution. This produces returns that are Student distributed for intraday time 
scales. The predicted results show excellent agreement with the data for all stocks in our study and 
over all regions of the return distribution. 
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It is well-known that the probability distribution of 
stock returns is non-Gaussian[l|, The distribution is 
fat tailed, which means that extreme price movements 
occur much more often than predicted given a Gaus- 
sian model. There is considerable interest in determining 
the origin of non-Gaussian returns, and a large number 
of recent papers on the subject have been written by 
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EHHlMlill. These studies often attempt to fit the 
shape of the return distribution and to determine how it 
scales in time. The shape and scaling of the distribution 
are important because they provide information about 
the underlying process that is driving asset prices. In 
addition, understanding the true distribution of returns 
is important for asset allocation, risk management, and 
option pricing. 

In this paper, we present evidence that the return 
distribution for stocks is non-Gaussian, similar across 
stocks, and stable in shape for intraday time scales. We 
show that these results are due to specific properties 
of the scale of individual returns (called volatility), and 
that the similarity of these properties across stocks allows 
for their return distributions to collapse onto one curve. 
This work is related to the large literature on stochastic 
volatility models [24|, and specifically to one of the orig- 
inal papers that suggested such a model to explain the 
non-Gaussian behavior of ret urns (23|. In that paper, the 
return distribution was assumed to be a mixture of Gaus- 
sian distributions with variances that are inverse gamma 
distributed - this produces returns that are Student dis- 
tributed. Here, we extend this result by assuming that 
volatility is slowly varying. This produces returns that 
are Student distributed throughout intraday time scales. 

Two explanations for the non-Gaussian shape of the 



return distribution are often mentioned in the liter- 
ature. Our model is an example of the mixture-of- 
distributions hypothesis, which states that return dis- 
tributions are a mixture of Gaussian distributions with 
different variances 0, [2^ 24, [2^, [2^. Several papers 
have suggested different explanations for why the vari- 
ance changes. The most popular explanation is that fluc- 
tuations in the rate of trade underlie these changes [lil 
24 . 25I 27 . 28j . Here, we measure time in increments of 
events rather than in clock increments and show that the 
return distribution exhibits interesting properties with- 
out considering the rate of trading. This is supported 
by previous work that reports fluctuations in the size of 
returns dominate those in trading^, HI, 3, 12 1- 



The other explanation for the non- Gaussian shape of 
the return distribution is known as the stable Paretian 
hypothesis - this states that returns are pulled indepen- 
dently and identically from a stable or truncated stable 
distribution P, 0]. Although a non-Gaussian stable dis- 
tribution can also be described as a mixture of Gaus- 
sian distributions with different variances [2^, the stable 
Paretian hypothesis is considered a separate hypothesis 
because it explains how the return distribution can re- 
tain its non-Gaussian shape for long time intervals with- 
out violating the assumption of independent and identi- 
cally distributed (HD) returns. Here, we show that the 
apparent stability of the non-Gaussian shape is not due 
to a stable distribution but instead is due to a slowly 
fluctuating volatility, which violates the HD assumption. 
This is supported by previous work that reports shuffling 
volatility removes the fat tails of the return distribution 
for longer time scales (3. [29j. 

To begin our analysis, we define the t*^ return as the 
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difference in logarithmic price from time t to time t + t, 



rt(r) = lii{pt+r) - In(pt), 



(1) 



where the price pt is the midpoint price between the best 
bid price and offer price in the market (these prices are 
known as quotes). There are several ways to set the unit 
of the time index, t. Here, we study returns over the 
finest possible time scale, which we call midpoint time. 
In midpoint time, t is updated whenever an event causes 
a change in the midpoint price. 

We model individual returns, rt(T = 1), as a discrete 
time stochastic process with a fluctuating variance. 



(2) 



where is an IID Gaussian A^(0, 1) random variable 
and is the local variance of the process (at is the 
standard deviation of returns at time t and is com- 
monly called volatility) . We neglect any drift for returns, 
which is small on the time scales we study here. In the 
econometrics literature, Eq. [2] is the standard form for 
an autoregressive conditional heteroskedasticity (ARCH) 
modeljsO]. Such models can be interpreted as a diffusion 
process with a time dependent diffusion parameter (in 
our case D = af)fi\.\l^. 

As originally noted by Mandelbrot [1| , at slowly varies 
in financial markets. This property is now commonly 
called clustered volatility^^, and its cause remains un- 
known. In our model, we assume that at is sufficiently 
slow varying, such that we can treat it as a constant 
over intraday time scales. The consequences of this as- 
sumption are discussed and empirically validated later. 
Replacing at with its local constant value a, individual 
returns can be approximated, rj « a^f We define the 



variable /? as the inverse squared volatility, f3 ■. 
that the return distribution can be written, 
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Therefore, within our model, the distribution of returns 
on any single day is a Gaussian with variance 1//3 = cr^. 
Because (3 can vary at longer time scales, the return dis- 
tribution observed with data pulled from many different 
days is obtained by marginalizing over (3, 



V{r,T)= / P{r,T\p)f{p)df3, 



(4) 



where /(/3) is the distribution of /3. This mixture-of- 
distributions formulation was originally suggested sev- 
eral decades ago to explain the non-Gaussian shape of 
the return distribution. As presented here, it is similar 
to the recent field of superstatistics, where the statistics 
of physical systems are separated by different time scales 
and stationary distributions are derived from the super- 
position of these statistics 12, 3ll. [3^. 



Motivated by the empirical data below, we assume that 
f3 is gamma distributed, /(/3|a,6) = r^j/^^'^e This 
is consistent with previous empirical work that reports 
that at is inverse-gamma distributed [33|. There are sev- 
eral economic explanations for why the inverse variance 
might have this distribution 3J, |35| , which we discuss 
later. In more general terms, a gamma distribution is 
one of several distributions with positive support that 
can be derived from universality arguments [36|. Carry- 
ing out the marginalization above gives the following for 
the distribution of returns, 



V{r, 



r[a+(l/2)] 1 



r[a] V2^ 



r 
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[a+(l/2)] 



_ , (5) 

which is a variant of the Student's i-distribution. Note 
that the return distribution is determined solely by the 
two parameters (a and b) from the distribution of the 
inverse squared volatility, /3, and that it remains a Stu- 
dent's i-distribution for different r. The distribution ap- 
pears stable, despite being outside the stable regime, be- 
cause volatility is assumed constant for these time scales. 

To facilitate the presentation of the empirical results 
below, we define the following normalized variables: 



r = 

r = 
-p* _ 



rtir)/^ 

n{T)h 

(A'P)l/[a+(l/2)]^ 




(6) 
(7) 
(8) 



where A = V27rr[a]/r[a + (1/2)]. These normalizations 
allow results for different time scales and different stocks 
to collapse on a single curve. 

To test the above model, we present empirical results 
for 5 stocks traded on the London Stock Exchange (LSE) 
from the period May 2, 2000 to December 31, 2002. 
There are 675 trading days during this period. The stocks 
are AstraZeneca (AZN), Lloyds TSB (LLOY), Prudential 
Pic (PRU), Reuters (RTR), and Vodafone Group (VOD). 
Our dataset contains information about the complete on- 
book market - including all on-book transactions, order 
placements, and cancellations [4l|. We truncate the first 
30 minutes of market activity to remove the effects of 
price discovery at the beginning of the day. In all of 
the results for aggregate numbers of events, we choose 
nonoverlapping intervals. We only consider intraday re- 
turns and do not include returns measured across days. 

In the plots below, we compare empirical results to 
those predicted by the above model. Others have suc- 
cessfully fit returns to a Student's t-distribution (called a 
q-Gaussian or Tsallis distribution in some papers) d, d, 0, 
E [11 El S El, 113. Note that we do not fit the return 
distribution here, but instead determine the two parame- 
ters, a and 6, from a maximum likelihood estimate given 
the daily /3's These parameters then set the return 
distribution for intraday time scales. 
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FIG. 1: Several plots for the stock AstraZeneca (AZN). (a) The probability density of daily /3 fit by a gamma distribution, 
(b) The probabihty density of ^* for different r compared to A''(0, 1). (c) The probability density of returns for different r 
compared to theory, (d) The cumulative distribution of returns for different r compared to theory. 



In Fig. [T] we present results for the stock AZN; al- 
though not shown, the results for the other stocks in our 
study are similar in appearance. In Fig. [ija), we plot 
the probability density function of /3. We overlay the 
plot with the best-fit gamma distribution and we report 
the parameters for this fit in the figure legend and also 
in Table [J In Fig. [Hb), we show the probability den- 
sity of ^* for r = 40 to r = 640 in log-log coordinates. 
This is compared to a normal distribution with zero mean 
and unit variance - which is assumed in our model. At 
time scales shorter than r = 40, which are not shown, 
the distribution of ^* is leptokurtic but with finite vari- 
ance. As seen in the figure, the distribution approaches 
a Gaussian for time scales, r > 40. That £,* is Gaussian 
distributed was also reported for daily time scales in[38t. 
In Figlljc) we plot the scaled return probability density 
for T = 10 to T = 640 in semi-log coordinates. Using 
the parameters a and 6, we predict the full probability 
distribution of returns as derived in Eq. [5] and overlay 
this prediction on the plot. We focus on the tails of the 
distribution in Fig[ljd) by plotting the scaled cumulative 



distribution function for the unsigned returns -F(|r*|) in 
log-log coordinates. As seen in both plots, the distri- 
butions collapse both in the central region and in the 
tails and are well described by the predicted curve. In 
our model, the collapse occurs because volatility is as- 
sumed constant intraday. For comparison purposes we 
fit a Gaussian distribution to the data for t — 80 and 
plot this in Figs[ljc,d). 



Security 


Events 


Events/Min 


a 


b 


AZN 


962516 


3.0 


2.7 


.44 X 10"'' 


LLOY 


746845 


2.3 


3.4 


1.1 X 10-'' 


PRU 


583792 


1.8 


2.6 


1.5 X 10"" 


RTR 


653915 


2.0 


3.9 


3.6 X 10"® 


VOD 


770352 


2.4 


3.9 


2.1 X 10"® 



TABLE I: Table of parameters for the five stocks studied. 

In Fig.[2l^a), we plot the empirical cumulative distribu- 
tion of (3 versus the fitted cumulative distribution for all 
5 stocks. This plot is created by first fixing the value of 
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FIG. 2: (a) The cumulative distribution of P compared to the cumulative distribution from the best fit to a gamma distribution 
for all the stocks in our study, (b) The normalized probability density of returns (with r — 80) compared to theory for all the 
stocks in our study. 



the fitted F{-), calculating (3 at this point, and then plot- 
ting the value of the empirical F{-) for this (3. The plot 
is similar to a Q-Q plot - when the empirical distribu- 
tion follows the fitted distribution exactly, the curve will 
lie on the 45° line. In Fig. [2I^b), we plot the normalized 
probability density 'P*{r*) with r = 80 for the five stocks 
in our study. The data from all five stocks collapse on 
the curve. 

Taken together, these results suggest that slow, sig- 
nificant fluctuations in volatility produce the interesting 
features of the intraday return distribution. In this pa- 
per, we do not provide an explanation for why volatil- 
ity has the properties that we have assumed. We note, 
however, that there exists a general class of stochastic 
volatility models that produce volatilities that are in- 
verse gamma distributed 3J| . The generalized ARCH 
(GARCH)(l,l)f39'l model and the 3/2 model are two spe- 
cific examples. Variations of the GARCH(1,1) model can 
be motivated by simple feedback mechanisms for volatil- 
ity and have been shown to produce similar results to our 
empirical findings [gI. [ssj. The 3/2 model is known to be 
a by-product of a one-dimensional diffusion equation for 
prices: a squared Bessel process of dimension fourfs?]. 
This process describes the dynamics of a growth optimal 
portfolio with deterministic drift and can be motivated 
by straightforward economic arguments 34 , 4^ . 

In this paper, we have presented a model for individual 
stock returns that reproduces the shape and scaling of the 
intraday return distribution for a collection of stocks from 
the London Stock Exchange. Our model decomposes in- 
dividual returns into the product of two terms: a Gaus- 
sian term and a slowly varying volatility term. On any 
single day, volatility is relatively constant so that returns 
are well described by Gaussian fluctuations. Across many 
days, however, fluctuations in the volatility term domi- 
nate and lead to a non-Gaussian distribution for returns. 



The resulting distribution ~ a Student's i-distribution - 
appears stable for short to intermediate time scales de- 
spite being outside the stable regime. This occurs be- 
cause volatility is slowly varying and therefore not IID. 
Thus, we can explain both the shape and apparent stabil- 
ity of the distribution, two results that previously seemed 
to contradict one another and that individually could be 
used to support one or the other competing hypothe- 
ses for non-Gaussian returns. Finally, we find that the 
statistical properties of volatility for different stocks are 
similar, allowing for a single representation of the return 
distribution for intraday time scales. 

This work was supported by the US National Science 
Foimdation Grant HSD-0624351. 
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